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i 1 ABSTRACT 

*-£h ■ We present the first results of a combined VLT VIMOS integral-field unit and Hubble Space 

Telescope (HST)/ACS study of the early-type lens galaxy SDSS J2321-097 at z = 0.0819, 
Q . extending kinematic studies to a look-back time of 1 Gyr. This system, discovered in the Sloan 

i—{ • Lens ACS Survey, has been observed as part of a VLT Large Programme with the goal of 

C/3 \ obtaining two-dimensional stellar kinematics of 17 early-type galaxies to z ~ 0.35 and Keck 

spectroscopy of an additional dozen lens systems. Bayesian modelling of both the surface 
brightness distribution of the lensed source and the two-dimensional measurements of velocity 
, and velocity dispersion has allowed us, under the only assumptions of axisymmetry and a two- 

J> ■ integral stellar distribution function (DF) for the lens galaxy, to dissect this galaxy in three 

OO ' dimensions and break the classical mass-anisotropy, mass-sheet and inclination-oblateness 

degeneracies. Our main results are that the galaxy (i) has a total density profile well described 
by a single power- lawp oc r -y with Y = 2.06+° 

is a very slow rotator (specific stellar 
angular momentum parameter - 0.075); (iii) shows only mild anisotropy (6 » 0.15); and 
— . (iv) has a dark-matter contribution of ~ 30 per cent inside the effective radius. Our first results 

t— I ' from this large combined imaging and spectroscopic effort with the VLT, Keck and HST show 

t — ! that the structure of massive early-type galaxies beyond the local Universe can now be studied 

■ in great detail using the combination of stellar kinematics and gravitational lensing. Extending 

these studies to look-back times where evolutionary effects become measurable holds great 

• i— i , promise for the understanding of formation and evolution of early-type galaxies. 

, 

j_j ■ Key words: gravitational lensing — techniques: spectroscopic — galaxies: elliptical and 

C$ ' lenticular, cD — galaxies: kinematics and dynamics — galaxies: structure. 



1 INTRODUCTION 

Within the hierarchical galaxy formation scenario, early-type 
galaxies are assumed to be the end-products o f major mergers 
with additional accretion of smaller galaxies (e.g. lBurkert & Naaa 
|2004| for a review). As such the study of their structure, forma- 
tion and evolution is an essential step in understanding galaxy for- 
mation and the standard A cold dark matter (ACDM) paradigm, 
in which the hierarchical galaxy formation model has its foun- 
dations. Early-type galaxies are observed to follow tight scaling 
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relations between their stellar population, dark matter, kinematic 
and black hole p roperties, the origins of w h ich are still not well 
understood (e.g . Djorgovski & Davis |l987t I Dress ier et "all 1 19871: 
iMagorrian et al.lll998l:lBolton et alj|2007l) . 

Considerable observational progress has been made in 
the last decades in our understanding of the relative contribu- 
tions of baryonic (mostly stellar), dark matter and black hole 
constituents of early-type galaxi es through stell a r dynamical 
tracer s and X-ray studies (e.g. iFabbianq 1989; Moul deTal] 
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More recently, the SAURON collaboration dde Zeeuw et al j |2002h 
has extensively studied a large and uniform sample of early-type 
galaxies in the local Universe (i.e. z g 0.01), combining SAURON 
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integral-field spectroscopy (IFS) on the William Herschel Tele- 
scope with high-resolution Hubble Space Telescope (HST) imaging 
to obtain a three-dimensional picture of these galaxies in terms of 
their mass and kinematic s tructure dEmsellem et al.l |2004| . |2007| ; 
ICappellari et al]200dl2007h . 

Despite making a great leap forward, the complex modelling 
of the SAURON systems and the unknown dark matter content and 
total mass of these galaxies still require some assumptions to be 
made about their gravitational potential [e.g. constant stellar mass- 
to-light ratio {MIL) from stellar population models] and inclina- 
tion. These issues can be partly overcome in discy systems where 
the inclination can be determined from the photometry or possibly 
from dust lanes. However, systems with lower ell ipticity can show 
stronger degeneracies (e.g. ICappellari et "all l2007h . Also, near the 
effective radiu s dark matter becomes a major contributor to th e stel- 
lar kinematics (Gerha rd et al.l200~ll ; lTreu & Koopmans! 2004) and it 
is not yet clear what impac t its neglect has, even though it is esti- 
mated to be not too severe ( Cappe llari et al.ll2006h . 

At higher redshifts, the extraction of detailed kinematic infor- 
mation from early-type galaxies becomes increasingly more dif- 
ficult because of the decrease in apparent size and cosmological 
surface brightness dimming. However, galaxies at higher redshifts 
are much mo re likely to act as gravitational lenses on background 
galaxies (e.g. lTurner. Ostriker. & Gotti ri984). The additional infor- 
mation obtained from gravitational lensing on, for example, the 
enclosed mass and the density profile has been shown to break 
some of the classical degeneracies in stellar kinematics and lens- 
ing, namely the mass-anisotropy and the mass-sheet degeneracies , 
respectively (e.g. | Treu & Ko opmans 2002a; [Koopmans et al. 2003; 
Koopmans & Treu 2003; Treu & Koopmans 2004; Koopmans et al. 
2006; Barnabe & Koopmans 2007], hereafter BK07). 



Tabl e 1. Basic data fo r SP SS J2321-097. The da ta are taken 
from lTreu et"al] l2006id) and lKoopmans et alj J2006h . 



In order to use gravitational lensing and stellar dynamics in a 
systematic way, the Lenses Structure & Dynamics (LSD) Survey 
was started in 2001, combining HST imaging da ta on lens systems 
with stellar kinematics obtained wit h Keck dKoopmans & Tret] 
2002, 2003l; lTreu & Koopmansll20023lbll2003L l2004). This project 
and its results, predominantly at z ~ 0.5-1.0, have shown the va- 
lidity of this methodology and its effectiveness at higher redshifts. 
These high-redshift systems, however, are relatively rare and hard 
to follow up, which has limited studies to the use of luminosity- 
weighted stellar velocity d ispersions (e.g. lKoopmans & Treul2002l ; 
iTreu & Koopmansll2002bl) or long-slit s pectroscopy in a few large 
apertures along the ma jor axes (e.g. iKoopmans & Trevj 1200 3l ; 
ITreu & Koopmansll2004l) . 

The Sloan Lens ACS Survey (SLACS, iBolton et all I2005L 
l2006h was begun to extend the sample of lens galaxies suit- 
able for joint lensing and dynamical studies. SLACS lens can- 
didates were selected from t he SDSS Luminous Red Galaxy 
sample dEisenstein et alj l200ll) and a quiescent subsample [de- 
fined by EW(Ha) < 1.5 A] of the MAIN SDSS galaxy sample 
dStraus s et al. 20 02T) by an inspection of their SDSS fibre spec- 
troscopy dBolton et al. [ |2004h . Galaxies whose spectra presented 
emission lines at a higher redshift than the redshift of the galaxy 
itself were observed in an HST snapshot programme to check 
whether the source of the emission lines was a gravitationally 
lensed background galaxy. To date, SLACS has confirmed around 
80 lens systems (Bolton et al., in preparation). In contrast to most 
earlier lens surveys which drew their candidate systems from cat- 
alogues of potential sources (e. g. quasi-stellar obje cts or radio 
sources as in the CLASS survey, iBrowne et alj [2003), SLACS is 
a Zens-selected survey and as such preferentially finds systems with 
bright lens galaxies but comparatively faint sources. It is therefore 
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an ideal starting point for detailed investigations into the properties 
of the lens galaxies. 

In this paper we introduce a follow-up project to SLACS 
which aims at combining the lensing information with detailed 
two-dimensional kinematic information obtained with the VIMOS 
integral-field unit (IFU) mounted on the VLT. The full sample com- 
prises 17 systems. In this paper we demonstrate the available data 
and the analysis methodology on one system, SDSS J2321-097. 
The lens in this system is an early-type galaxy at z = 0.0819. 
The SDSS spectrum shows additional [O n] and H/3 e mission from 
a gra vitationally lensed blue galaxy at z s = 0.5342 (Bolt on et al.1 
2006). In HST imaging (Fig. [TJ the lensed source is shown to 
form an almost complete Einstein ring of radius 1.68arcsec, cor- 
responding to an Einstein radius of i?Einst = 2.6 kpc at the lens 
redshift. The projected mass enclosed within the Einstein radius 
is M(< R Ems t) = 1-3 x 10" Mq. Details on this system are listed in 
Table □ 

In Section [2] we describe the VIMOS and HST observations 
that form the basis of this analysis. The IFU data are presented in 
Section [3] with a description of the data reduction in Section [3~T| 
and of the extraction of two-dimensional kinematic maps in Sec- 
tion [3]2] In Section [3~3l we determine several commonly used pa- 
rameters from the kinematic data alone, in a way that is directly 
comparable to earlier studies, for example from the SAURON col- 
laboration. We apply the joint lensing and dynamics analysis de- 
veloped by BK07 in Section [4] In Section [5] we present a phys- 
ical argument that explains why gravitational lensing and stellar 
dynamics can break the degeneracy between oblateness and incli- 
nation. We discuss our results and conclude in Section [6] We use 
a concordance ACDM model throughout this paper, described by 
£1 M = 0.3, fi A = 0.7 and H = 100 h km s" 1 Mpc" 1 with h = 0.7 un- 
less stated otherwise. At the redshift of SDSS J2321-097, 1 arcsec 
corresponds to 1.1 h kpc = 1.5 kpc. 
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Figure 1. HST/ ACS images of SDSSJ232 1-097, taken through the F435W (left-hand panel) and F8I4W (right-hand panel) filters. The images are single 
exposures w i th an integration time of 420 s; the pixel scale is 0.05 arcsec. A full description of these observations and the data-reduction process is given in 
iBolton et alJfcOOd) . 



2 OBSERVATIONS 

In this section, we briefly discuss the VLT VIMOS/IFU spectro- 
scopic and the HST imaging observations, which form the basic 
data sets for the subsequent modelling. 



2.1 Integral-field spectroscopy 

We have obtained integral-field spectros copic observations of 
17 systems using the IFU of VIMOS jLe Fevre et ail 1200 ll) 
mounted on the VLT UT3 (Melipal). The observations were con- 
ducted in two parts: three systems were observed in a pilot pro- 
gramme (ESO programme 075.B-0226; PI: Koopmans) and the re- 
maining 14 systems in a large programme (177.B-0682; PI: Koop- 
mans) with a slightly different observational setup. Observations 
were carried out in service mode throughout to ensure uniform 
observing conditions. In this paper we report in detail on the ob- 
servations of the pilot programme and the analysis of one system, 
SDSS J2321-097. The observations of the large programme and 
the analysis of the full sample will be presented in forthcoming pa- 
pers in this series. 

The IFU head of VIMOS consists of a square array of 80 x 80 
microlenses with a sampling of 0.67 arcsec per spatial element 
('spaxel') at the Nasmyth focus of the telescope. In high-resolution 
mode only the central 1600 lenses are used, resulting in a field of 
view of 27 x 27 arcsec 2 . The lenses are coupled to optical fibres 
which transmit the light to pseudo-slit masks, each of which con- 
tains 400 fibres and corresponds to a quadrant of the original field. 
The seeing limit for the observations was set to 0.8 arcsec, so that 
each fibre of the IFU gives a spectrum that is essentially indepen- 
dent of its neighbours. 

For the pilot programme, we employed the HR-Blue grism 
which covers a wavelength range from 4000 to 6200 A (the exact 
wavelength range varies slightly from quadrant to quadrant). The 
spectral resolution of the HR-Blue grism is A/AA = 2550, where 
AA is the full width at half-maximum (FWHM) of the instrument 
profile. The mean dispersion on the CCD is 0.5 A per pixel. Three 
exposures of 555 s integration time were obtained per observing 
block (a one-hour sequence of science and calibration exposures); 



the telescope was shifted by about 3.5 arcsec between the expo- 
sures to fill in on dead fibres. The number of observing blocks 
spent on each lens system was adjusted so as to reach approxi- 
mately equal signal-to-noise ratios (5 /AO across the sample. For 
the system described in this paper, SDSS 12321-097, five observ- 
ing blocks were carried out for a total integration time of 8325 s. 
For the large programme, we switched to the HR-Orange grism 
which covers a wavelength range from 5250 to 7400 A at a simi- 
lar spectral resolution of A/AA = 2650 and a dispersion of 0.6 A 
per pixel. The spectral resolution translates to a rest-frame velocity 
resolution An varying between 107 and 85 km s -1 , improving as the 
redshift of our sample lens galaxies increases from 0.08 to 0.35. 

2.2 HST imaging 

High-resolution imaging with HST/ ACS or tf5T/WF PC2 has been 
or is being obtained for al l confirmed SLACS lenses (Bolt on et al.l 
l2006l : lGavazzi et al.ll2007l) . Th e data used here com e from the ACS 
snapshot survey described by iBolton et al.l d2005l) and consist of 
single 420 s exposures through the F435W and F814W filters, re- 
spectively. Fig.[T]shows the images of SDSS J2321-097. The lens 
modelling requires as input a clean estimate of the brightness dis- 
tribution of the lensed images, so the smooth brightness distri- 
bution of th e len s galaxy has to be subtracted from the images. 
IBolton et al.l d2006h tried a variety of commonly used modelling 
techniques (e.g. fitting of Sersic profiles), but settled on the more 
flexible B-spline technique. The galaxy-subtracted image in the 
F814W filter is shown in the top right-hand panel of Fig.|7] 



3 IFU DATA ANALYSIS 
3.1 Data reduction 

The VLT VIMOS/IFU data were reduced using the vipgi package 
which was developed within the framework of the VIRMOS con- 
sortium and the VVDS project. vipgQ has been described in detail 

1 http://cosmos.iasf-milano.inaf.it/pandora/vipgi.html 
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Figure 2. Left-hand panel: VLT VIMOS/IFU image reconstruction of SDSS J232 1-097, obtained by integrating along the individual fibre spectra from 4400 
to 6100 A. Right-hand panel: map of the strength of the [On] doublet from the lensed source. 



bv lScodeggio et all J2005h and lZanichelli et al.l J2005h ; here, we re- 
strict ourselves to a brief summary of the main reduction steps per- 
formed by vipgi. 

The four quadrants of the IFU are treated independently up to 
the combination of all exposures into the final data cube. For each 
night on which observations were carried out, five bias exposures 
were obtained which are median-combined and subtracted from the 
data. The observed halogen-lamp illuminated screen flats are used 
to locate the traces of the individual spectra on the CCD, starting 
from interactively adjusted first guesses of the layout of the fibres. 
This includes a shift correction and an identification of 'bad' fibres. 
The position of the peak of each spectrum in the spatial direction 
is then modelled as a quadratic polynomial function of the position 
in the dispersion directio n, vipgi does not perform a CCD pixel-to- 
pixel flat-field correction (Scodeggio et all2005h . 

The wavelength calibration is subsequently done using he- 
lium and neon lamp observations taken immediately after the sci- 
ence exposures. With about 20 lines spread between ~ 4000 and 
~ 6200 A, the rms deviations from the polynomial fit are roughly 
Gaussian-distributed around a mean of ~ 0.075 A and a dispersion 
of ~ 0.02 A. The velocity error introduced by uncertainties in the 
wavelength calibration is thus of the order of 5 km s -1 , significantly 
smaller than the spectral resolution and negligible in our analysis. 

The individ ual spectra a re extracted using the optimal extrac- 
tion scheme o f iHornel ( fl986h . that is, by averaging the flux across 
the spectral trace of the fibre, weighted by a fit to its spatial pro- 
file. In order to correct for the transmissivity variations from fibre 
to fibre ('spaxel-to-spaxel flat-fielding'), the flux under a sky emis- 
sion line is measured and scaled relative to a reference fibre. We 
use [O i] at A = 5577 A for this purpose. 

Spectrophotometric standard stars were observed during most 
nights. Since the stellar light only covers a small fraction of the fi- 
bres in the IFU field whereas the calibration is applied to all fibres, 
the spectrophotometric calibration only accurately corrects the rel- 
ative wavelength dependence of the sensitivity function (assuming 
that this is the same for all fibres), while the absolute flux levels are 
determined with a much larger uncertainty due to residuals from 
the correction of the varying fibre transmissivities. 

The VIMOS IFU does not have dedicated sky fibres, but the 
field of view is large enough for a sufficient number of fibres to es- 



sentially measure pure sky. vipgi identifies these fibres in a statisti- 
cal way by integrating the flux in each fibre over a large wavelength 
range and selecting as sky fibres those with a flux within ±0. 1 per 
cent of the mode of the resulting distribution. 

Finally, the four quadrants of all the individual exposures are 
combined into a single data 'cube' after correction for the dithering 
between exposures. The final data are in row-stacked form, where 
each row corresponds to the spectrum from one fibre. A fits table 
extension gives the translation from row number to the spatial co- 
ordinate of the fibre. 

The left-hand panel of Fig. [2] shows an image of 
SDSS J2321-097, reconstructed from the reduced data cube using 
the program sadicQ by integrating along the wavelength direction 
from 4400 to 6100 A. The galaxy surface brightness distribution 
agrees well with that seen in the HST/ACS F435W and F814W 
snapshot images, although it is not as precise due to the complexi- 
ties involved in the spectral calibration, in particular of the relative 
fibre transmissivity. In the modelling of the lens galaxy brightness 
distribution, we therefore use the HST images. The imperfect abso- 
lute flux calibration of the IFU data has no effect on the measure- 
ments of the stellar kinematics which do not depend on the overall 
scaling of each fibre spectrum. 

Following lBolton & Buriesl d2007h . we can use the IFU data to 
obtain an image of the lensed source by isolating its redshifted [O n] 
emission line at 5710 A. The [O n] doublet is modelled as the sum 
of two Gaussians with separation 4.5 A (taking into account the 
redshift of the source) and relative strength 3:2. In regions with sig- 
nificant light from the lens galaxy the best-fitting kinematic model 
(Section l3.2t was subtracted prior to the fit; elsewhere a polyno- 
mial fit was subtracted to remove a remaining continuum level. The 
map of the line strength of the [O n] line obtained in this manner is 
presented in the right-hand panel panel of Fig. [2] The map clearly 
shows the ring-like structure of the lensed galaxy (cf. the higher res- 
olution galaxy-subtracted ACS image in the top right-hand panel of 
Fig. [7} and demonstrates that the [O n] line used for the inclusion 
of this system in the SLACS sample indeed originates from a grav- 
itationally lensed background galaxy. 



http://cosmos.iasf-milano.inaf.it/pandora/sadio.html 
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Figure 3. Spectram of SDSS J2321-097 integrated from the VIMOS/IFU data cube over a circular aperture of diameter 3 arcsec. The inset on the right-hand 
side shows a zoom on the [On] doublet of the lensed background galaxy at z s = 0.5342, next to the FeCa /15268 line of the lens. The inset on the left-hand 
side shows a zoom on the weak but clearly detected [O n] line of the lens galaxy. Several other spectral features of the lens are marked. 



The spectral energy distribution of SDSS J2321-097 is essen- 
tially that of a typical early-type galaxy with strong metal lines, as 
seen in Fig. [3] which shows the spectrum integrated over a circu- 
lar aperture of 3 arcsec diameter (i.e. the SDSS aperture). We note, 
however, that this spectrum shows weak but significant [O n] emis- 
sion (EW 4 A) as well as US absorption from the lens galaxy. 

This might indicate residual star formation or active galactic nu- 
cleus activity at a low level. 



3.2 Kinematic analysis 

We extract two-dimensional maps of systematic velocity v(R) 
and line-of-sight velocity dispersion cr(R) using a direct pixel- 
fitting method implemented in the statistical language 
(R Development Core Teamll2007l) . Since the analysis is done on 
spectra from individual IFU fibres, the spectra have relatively 
low signal-to-noise ratio, in particular in the outskirts of the lens 
galaxy where the surface brightness drops rapidly. We therefore 
restrict the model to a Gaussian line-of-sight velocity distribu- 
tion (LOSVD) and do not attempt to measure higher order mo- 
ments, such as the Gauss-Hermite coefficients h 3 and h 4 commonly 
used in kinematic studies of low-redshift early-type galaxi es (e.g. 



. . 3e g a 

Ivan der Marel & Franxlll993l:ICappellari & Emsellernll2004l) . 

The observed galaxy spectrum dj(A) is modelled, for each fi- 
bre i separately, by the convolution of a high-resolution stellar tem- 
plate spectrum f(ln/i) with a Gaussian kernel gr,(hi/l) with disper- 
sion cr„jlc: 



d,(A) = f,(A)[t®g,](\nA) + n I (A) . 



(1) 



We assume Gaussian noise n(A) on the spectrum. Since the accu- 
racy of the spectrophotometric calibration is not perfec t for both our 
spect ra and the chosen template spectra (see below and Val des et"al] 
120041) there remain small relative tilts between the continua of the 
galaxy and template spectra. We model this difference by the linear 
correction function /(A) = a + bA whose parameters are determined 



by a linear fit nested within the non-linear optimization for the kine- 
matic parameters t), and cr 0(i , 

The template spectrum is first resampled to a logarithmic 
wavelength scale with step A In A = Au/c, where we use a velocity 
step of An = 25 km s~' . The convolution kernel is a Gaussian with 
dispersion cr G = crJAv, extending to ±8cr G - The galaxy spectrum 
is corrected for the mean redshift of the lens galaxy, z = 0.0819 in 
the case of SDSS J2321-097, taken from the SDSS data base. The 
model is then resampled on to the wavelength grid of the galaxy 
spectrum, a linear wavelength scale A obs /(l + Zsdss) in our case. 
This avoids further resampling of the data beyond what was done in 
the data reduction. Similar methods typically resample both model 
and da ta to the same logarithmic wavelength grid before comparing 
(e.g. lvan der Marellll994l ; ICappellari & Emselleml2004h . 

Remaining cosmic rays, emission lines, and remnants of sky 
lines are masked automatically by 3<r-clipping of the residuals from 
a preliminary fit without mask. Fig. [4] shows an example of a fit to 
a spectrum from a fibre near the centre of SDSS J2321-097. Error 
analysis is performed through a Monte Carlo technique by adding 
Gaussian noise to the best-fitting model at the noise level given by 
the residuals between model and data, and using 1000 realizations 
to determine the spread of the best-fitting parameters. 

The choice of a good template spectrum is important 
for accurate kinematic measurements from galaxy spectra. For 
SDSS J232 1-097, we started by experimenting with a set of eight 
stellar template spectra obtained with the Echelle Spectrograph and 
Imager (ESI) on the Keck II telescope dKoopmans & Treull2002h 
applied to the global spectrum of SDSS J2321-097. These spectra 
have a velocity resolution of Auesi = 22.4km s~' (ESI manuaQ) 
and cover a range of spectral types GO to K4, luminosity class III. 
Fitting to the integrated spectrum of SDSS J232 1 -097 it was found 
that four templates for stellar types between G7 and K0 gave con- 
sistent results for the velocity dispersion within ±10kms~', which 
also agreed with the measurement from the SDSS. Cooler template 
stars (K2 and K4) resulted in a significantly higher velocity disper- 
sion while hotter templates (GO and G5) gave lower values. On the 



3 http://www.r-project.org/ 



4 http://www2.keck.hawaii.edu/inst/esi/echmode.html 
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Figure 4. Spectrum from a single fibre near the centre of SDSS J232 1-097, 
with the overlaid best-fitting model (top panel). The bottom panel shows 
the residuals of the fit. The [On] doublet from the source at 57 10 A (z s = 
0.5342) and the residual of the [O i] sky line at 5577 A were masked auto- 
matically (indicated by the points). 



basis of the visual appearance of the fit and the shape of the galaxy 
spectrum it was decided to use the K0 star HR 19 as the template 
star for SDSS J2321-097. 

We subsequently exchanged the ESI spectrum of HR 19 for the 
corresponding s pectrum from the Indo-US library of Coude feed 
stellar spectrc0 ( Valdes et al. 2004). The Indo-US library contains 
spectra for 1273 stars covering a wide range of stellar parameters. 
The spectra have a resolution of ~ 1 A FWHM and for the ma- 
jority of stars cover a wavelength range of 3460 to 9464 A. While 
the resolution of the Indo-US spectra is somewhat lower than that 
of the ESI spectra, this disadvantage is offset by the large number 
of spectra available, which will provide freedom of choice for the 
analysis of our full sample. We have checked that the kinematic re- 
sults obtained with the Indo-US and the ESI templates are indeed 
consistent. 

Fig. [4] shows an important limitation of any use of observed 
stellar spectra as templates in kinematic analyses, namely the dif- 
fering abundance ratios b etween stars in the solar ne ighbourhood 
and early-type galaxies ( iBarth, Ho. & Sargentl 120021) . While the 
strengths of the Fei/Ca 1/15268 lines agree, the Mgfc/15174 line is 

5 http://www.noao.edu/cflib/ 



significantly enhanced in the galaxy spectrum. IBarth et all d2002l) 
recommend masking the Mgb line before fitting. Doing this in- 
creases the velocity dispersion only by a few per cent and does not 
significantly improve the fit outside the Mg b region. 

3.3 Results from the observed stellar kinematics 

A fit of a kinematic model to the global spectrum (Fig.fJJ following 
the procedure described in Section fJ^ results in a velocity disper- 
sion of a m = 240 ± 8kms _1 , which is consistent wi th the value 
(XsDss = 236 ± 7kms~' listed in the SDSS data base (Bolt on et al.l 
2006). Note that the error estimates are not directly comparable 
because of differences in the estimation method. We believe our 
Monte Carlo estimate to be more reliable, although it might still un- 
derestimate the r eal error due to the as sumption of Gaussian noise 
in the spectrum dde Bruvne et al.ll2003l) . 

The two-dimensional maps of the velocity and velocity dis- 
persion are shown in Fig. [5] Only fibres in which the spectrum has 
an S/N per pixel in the rest-frame wavelength range from 5350 A 
to 5450 A (a flat part of the spectrum) of S/N > 8 are used in these 
maps and the subsequent analysis. Spectra with lower S/N were 
found not to yield reliable kinematic fits. 

Neither the velocity nor the velocity dispersion map shows 
significant structure. The velocity map might hint at a pattern of 
slight rotation, although within the noise the map is consistent with 
no rotation at all. 

In order to quantify the amount of rotation in a galaxy from 
two-dimensional maps of line-of-sight st ellar velocity v and veloc- 
ity dispersion <r„, lEmsellem et alj J2007T) defined a stellar angular 
momentum parameter A R by 



CRM) 



(2) 



where R is the projected distance from the galaxy centre. We find 
/1r = 0.075, which in combination with the absolute magnitude 
M B = -21.72 places SDSS J2321-097 among the slowly rotat- 
ing galaxie s at the high-luminosi ty end of the SAURON sample 
(cf. fig. 7 of lEmsellem et alj|2007l) . 

Following the prescription of iBinneyj (2005) and 
Cappellari et al.1 d2007f) . we find a low value for the ra- 
tio between systematic velocity and velocity dispersion of 
((u 2 )/(cr 2 )) 1/2 = 0.1, which in combination with the observed 
ellipticity of 6j so ,2D = 1 - qi so ,m = 0.23 from isophote fitting 
places SDSS J232 1-097 well below the curve for an isotropic 
rotating oblate galaxy in an e—(v/cr v ) diagram. From the ten sor 
virial theorem the global anisotropy parameter (Binney 2005) is 
found to be S ~ 0.15, where no correction for inclination has 
been made yet. In Section[4] we calculate the same quantity from 
the reconstructed stellar DF and find effectively the same result. 
Again, SDSS J232 1-097 compares well with the high-lum inosity 
galaxies from the SAURON sample dCappellari et alj|2007l) in that 
its ellipticity requires an anisotropic velocity distribution and is not 
due to rotation. 

The velocity dispersion is nearly constant across the field as 
one would expect for a nearly isothermal total mass distribution. 
Fig. [6] shows the velocity and velocity dispersion as a function of 
position along the major and minor axes (fibres within 1 arcsec of 
the respective axis were used). The major axis velocity cut shows a 
hint of the S-shape expected from rotation around the minor axis, 
although at a very low level. The velocity dispersion along the ma- 
jor axis is perfectly flat, whereas the minor axis cut shows a slight 
decline outwards. 
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Figure 5. Kinematic maps of SDSS J2321— 097. The top left-hand panel shows systematic velocity relative to the mean redshift of the lens galaxy; the bottom 
left-hand panel shows the velocity dispersion. The panels on the right-hand side visualize the con'esponding lcr errors on the measurements. Spaxel size is 
0.67 arcsec and only spectra with S/N > 8 were analysed. 
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Figure 6. Velocity (top panels) and velocity dispersion (bottom panels) as 
a function of position along the major (left-hand panels) and minor (right- 
hand panels) axes. Spectra from fibres within 1 arcsec of the respective axis 
were used here. 



To quantify this, we perform a linear fit to the velocity disper- 
sion profile as a function of the elliptical radius ji 2 = x 2 + y 2 /q 2 sa 2rj! 
with <5>i so _2D = 0.77 obtained by ellipse fitting to the light distri- 
bution. The linear slope is dcr/d/i = (-7.3 ± 4.5)kms~' arcsec - ', 
consistent with being fully isothermal. 



4 JOINT SELF-CONSISTENT GRAVITATIONAL 
LENSING AND STELLAR DYNAMICS ANALYSIS 

In this section, we combine the information from the gravitation- 
ally lensed image with the surface brightness distribution from HST 
observations and the projected velocity moment maps derived from 
VLT observations of the lens galaxy SDSS J232 1-097 in order to 
constrain the total mass density profile of this galaxy. The analysis 
is carried out by making use of the cauldron algorithm, presented 
in BK07: we refer to that paper for a detailed description of the 
method. 

4.1 Mass model and overview of the joint analysis 

The key idea of a self-consistent joint analysis is to adopt the 
same total gravitational potential O for both the gravitational lens- 
ing and the stellar dynamics modelling of the data. As shown in 
BK07, these two modelling problems, although different from a 
physical point of view, can be expressed in an analogous way as a 
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Figure 7. Best model lens image reconstruction for the system SDSS J232 1-097. From the top left-hand to bottom right-hand panel: reconstructed source 
model: HST/ ACS data showing the lensed image after subtraction of the lens galaxy; lensed image reconstruction; residuals. 



single set of coupled (regularized) linear equations. For any given 
choice j\ of the non-linear potential parameters, the equations can 
be solved non-iteratively to simultaneously obtain as the best so- 
lution for the chosen potential model the unlensed source surface 
brightness distributions and the weights of the elementary stellar 
dyna mics building blocks (e.g. orbits or two-integral c omponents, 
TICs. Ischwa rzschild 1 97^ ; IVerolme & de Zeeu wj |2002h . This lin- 
ear optimization scheme is consistently embedded in the frame- 
work of Bayesian statistics. As a consequence, it is possible to ob- 
jectively assess the probability of each model by means of the ev- 
idence merit function and, t herefore, to rank different models (see 
MacKavl [l992l . 1 19991 , 120031) . In this way, by maximizing the evi- 
dence, the set of non-linear parameters r\ which corresponds to the 
best potential model, given the data, can be recovered. 



While the method is completely general, its practical imple- 
mentation (in the cauldron algorithm) is more restricted to make 
it computationally efficient and applies specifically to axisymmet- 
ric potentials, <t>(R,z), and two-integral DFs / = f(E,L z ). Un- 
der these assumptions, the dynamical model can be constructed by 
making use of the fast BK07 numerical imple mentation of the two- 
inte gral Schwarzschild method developed bv lCretton et al] i 19991) 
and Verolme & de Zeeuwl J2002 b. whose building blocks are not or- 



bits (as in the classical Schwarzschild method) but TICs|j Among 
the outcomes of the joint analysis presented in this section is the 
weights map of the TIC superposition, which can be related to the 
weighted two-integral DF (see BK07). 

As a first-order model and in order to allow a straightfor- 
ward comparison with the results of the preliminary analysis of 
iKoopmans et al.l d2006l) . we adopt as the total mass density distri- 
bution of the galaxy a power law stratified on axisymmetric ho- 
moeoids: 



p(m) ■ 



Pa 
my' ' 



< / < 3, 



(3) 



where p is a density scale, y' will be referred to as the (logarithmic) 
slope of the density profile, and 



c z a 1 a-q l 



(4) 



with a > c and c/a = q. 



6 A TIC can be visualized as an elementary toroidal system, completely 
specified by a particular choice of energy E and axial component of the an- 
gular momentum L z . TICs have simple i/R radial density distributions and 
analytic unprojected velocity moments, and by superposing them one can 
build /(£, L z ) models for arbitrary spheroidal potentials (cf. Cret ton et al] 
1999): all these characteristics contribute to make TICs particularly valu- 
able and 'inexpensive' building blocks when compared to orbits. 
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Figure 8. Best dynamical model for the galaxy SDSS J232 1-097. First row: observed surface brightness distribution, projected line-of-sight velocity and 
line-of-sight velocity dispersion. Second row: corresponding reconstructed quantities for the best model. Third row: residuals. The low-level ripples in the 
surface brightness residuals are due to the discrete nature of the TICs. 



The (inner) gravitational potential associated with a ho- 
moeoi dal density distributio n p(m) is give n by Chandrasekhar's for- 
mulajChandrasekhar 1969; see also e.g. iMerritt & Fridmanlll996l 
and lCiotti & Bertinll2005h We find, for / t 2, 



<b(R,z) 



-4>o r 

Jo 



[!/(/- 2)] m 
(1 + r)^q 2 +T 



■dr . 



where ®o = 27iGqa 2 po and 

p2 ,2 

. i « z 



(5) 



(6) 



(7) 



a 2 (l+r) a 2 (q 2 + r)' 
For Y = 2 we have 

Jo (1 + r)^q 2 + T 

There are three free non-linear parameters in the potential to 
be determined via the evidence maximization: O ( or equivalently, 
through equation [B4] of BK07, the lens strength ag), the slope y' 
and the axial ratio q. One can also easily include a core radius ^ s 
by modifying the definition of the homoeoidal radius m, but for 
the purpose of the present analysis it was kept fixed to a negligibly 



small value. In addition to the previously mentioned parameters, 
there are four additional parameters which determine the geometry 
of the observed system: the position angle i?pa, the inclination i and 
the coordinates of the centre of the lens galaxy with respect to the 
sky grid. 

The cauldron algorithm is run on the data sets (and the corre- 
sponding covariance matrices) described in the previous sections. 

(i) The lensed image surface brightness distribution is defined 
on a 100 x 100 grid (1 pixel = 0.05 arcsec) on which the inner- 
most and outermost low-5 /N regions have been masked out, and 
the source is reconstructed on a 40 x 40 grid. To include the effects 
of seeing in the modelling, the lensing matrix is blurred with a blur- 
ring operator which takes into account the HS T/ACS F 814W point 
spread function (PSF) obtained with tiny tim dKristl 19931) . 

(ii) The lens galaxy surface brightness distribution is given on 
a 50 x 50 grid (1 pixel = 0.10 arcsec), while the kinematics data 
are given on a 9 x 9 grid (1 pixel = 0.67 arcsec), but masking out 
all the points with S/N less than 8. The dynamical modelling em- 
ploys 100 TICs, each one populated with Nnc = 1 x 10 5 particles. 
The TICs are selected by considering N E = 10 elements logarith- 
mically sampled in the radius R c of the circular orbit with energy 
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Table 2. SDSSJ232 1-097: non-linear parameters for the best 
power-law model p oc m~ r (equation [5) ■ 



Model: power law 




At best 




(' 


67? 8 


Non-linear 


ao 


0.468 


parameters 


Y 


2.061 




q 


0.739 




log -li en 


-2.344 


Hyperparameters 


io g 4*° 


-0.023 




io g 4 yn 


1.810 




Slen 


-78480 


Evidence 


Sdyn 


-14558 




£tot 


-93 038 



E c = E(R C ), and N L , = 5 elements linearly sampled in angular mo- 
mentum, mirrored for negative L z . The PSF is modelled as Gaus- 
sians with 0.10 and 0.90arcsec FWHM for the two grids, respec- 
tively. 



A curvature regularization (as described in ISuvuetal.ll2006l 
and appendix A of BK07) is adopted for both the gravitational 
lensing and the stellar dynamics reconstructions. As discussed in 
BK07, the initial guess values of the hyperparameters (which set 
the level of the regularization) are chosen to be quite large, since 
the convergence to the maximum is faster when starting from an 
overregularized system. Moreover, in order to further speed up the 
reconstruction, the position angle &p A and the coordinates of the 
lens galaxy centre £ c = (jc c , y c ) are fairly accurately determined 
by means of a preliminary optimization run and subsequently kept 
fixed or allowed to vary only within a limited range around the de- 
termined values. The position angle i?pa = 135°5 obtained in this 
way is very close to both the observed value for the light distribu- 
tion i?, = 126° 5 and the position angle i?sie = 1 36°2 obtained for 
the sin gular isothermal ellipsoid lensing model in Koopma ns et al.l 
(2006)? 



4.2 Results 

The non-linear parameters of the best-fitting model obtained from 
the combined gravitational lensing and stellar dynamics reconstruc- 
tion are presented in Table l4.2l together with the best-recovered hy- 
perparameters and the values of the evidence. The recovered incli- 
nation angle is i = 67° 8. The lens strength, proportional to the nor- 
malization constant of the potential Oo (see appendix B of BK07), 
is a = 0.468. The logarithmic slope and the axial ratio of the to- 
tal density distribution of the power-law model are y' = 2.061 and 
q = 0.739, respectively. 

The reconstructed observables corresponding to the best- 
fitting model are shown and compared to the data in Figs [7] and [8] 
for, respectively, gravitational lensing and stellar dynamics. Fig. [7] 
also displays the reconstructed source model, while the reconstruc- 
tion of the weighted two-integral DF of the lens galaxy is given in 
Fig.|9] 

The presence of an external shear was also examined, giving 
results consistent with zero. Furthermore, in order to assess the pos- 
sible effect of a faint galaxy located at (2'.' 55, 2'.' 10) north-west of 
the centre of the lens galaxy on the sky (Fig.QJ, a singular isother- 
mal sphere (SIS) centred on the corresponding position was added 



to the model, but the best reconstruction unambiguously excluded 
any relevance of this component (its lens strength is not significant). 

As a sanity check, we also performed the reconstruction with 
a different set-up for the dynamics, that is, employing a larger num- 
ber of TICs (N E = 18 and Nt. = 9) or increasing the number 
of populating particles to 7Y T i C = 12 x 10 5 while maintaining the 
same number of TICs. In both cases, the obtained values for the 
non-linear parameters do not differ significantly from the ones pre- 
sented in Table 14. 2l There is an improvement in the total evidence 
mainly because of the slightly better fit to the surface brightness 
distribution. 

The most remarkable result of the reconstruction is the loga- 
rithmic slope y ^ 2, making the total density profile very close to 
isothermal. This is i n good agreement with t he previous finding of 
y ^ 1.9 obta ined by Koopmans et al.l ( 120061) for the same system. 
The fact that [Koopmans et al.l J2006f) achieved robust results with 
a simpler and not fully self-consistent joint lensing and dynamics 
analysis (in which they only used the average cr\ os from the SDSS 
fibre spectroscopy as a constraint for the dynamics and solved the 
spherical Jeans equations) is a strong indication of the effectiveness 
of the combined analysis in pinning down the characteristics of the 
total potential. 

It should be stressed that the role of the kinematic constraints 
is crucial in breaking the ample degeneracies that would arise from 
a reconstruction limited to the lensing data: in particular, the fairly 
constant line-of-sight velocity dispersion map unyieldingly rules 
out values for the slope which are significantly far from isothermal, 
but which would turn out to be perfectly reasonable (even based on 
the quantitative evidence) if lensing alone was considered. 

The lens galaxy surface brightness distribution together with 
the line-of-sight projected velocity map (which shows a slight in- 
dication of rotation of the system) provide important information 
on the flattening and the inclination of the system, although these 
quantities remain the most difficult to disentangle. The weak (of the 
order of a few per cent) but discernible pattern in the residuals of 
the surface brightness reconstruction can be ascribed to the effect 
of the discreteness of the adopted TIC library, but could also be an 
indication of a slight offset in the orientation between the luminous 
matter distribution and the total potential: when the reconstruction 
is limited to lensing alone, the obtained position angle is typically 
a few degrees larger than & PA found for the joint reconstruction. 

Much information about the best reconstructed dynamical 
model is contained in the weighted two-integral DF presented in 
Fig. [9] (refer to appendix D of BK07 for a precise definition of the 
weights jj which are the quantity shown in this plot). Since all the 
original TICs have equal mass by construction, each pixel in the in- 
tegral space provides the relative contribution in mass of the corre- 
sponding TIC. The weighted DF is only slightly asymmetric around 
L z /L max = 0, which is reflected in the very low values of the rotation 
velocity of the best model system (cf. central panel of Fig. [8}. The 
very weak indication of counter-rotation which can be detected in 
the same panel is also more clearly exhibited in the weighted DF: 
for the higher R c the TICs with negative L- (the red pixels) show 
larger values than the corresponding (mirrored) TICs with positive 
L z , while the opposite behaviour is displayed for R B <; 1. Given the 
low S /TV, however, we do not place much significance on this pos- 
sible counter-rotation, although SAURON has disco vered several 
systems with such a behaviour dEmsellem et alj2004l) . 
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Figure 9. Reconstruction of the weighted two-integral DF of the lens galaxy 
SDSS J2321-097 obtained from the best-fitting model. Each pixel in the 
two-integral space gives the relative contribution of the corresponding TIC 
to the total mass of the modelled system. Here, L max indicates the maximum 
value of the angular momentum along the z axis: Lmax = Rc»c, where R c is 
the circular radius and v c (R c ) is the circular velocity corresponding to a total 
energy E(R„) = <D c ff(R c ,0) = *(R C ,0) + v 2 J2. 



4.3 Inferences from the phase-space distribution 

From the best reconstructed DF we calculate the axial ratio <7*,3d 
of the three-dimensional light distribution, which does not need to 
coincide with the axial ratio of the total density distribution p{m) 
since in our approach mass is not required to follow light. We adopt 
the definition 



<?;,3D 



= 2 



J P*z- dV 
Jp*R 2 dV' 



(8) 



where R and z are the usual cylindrical coordinates and V is the vol- 
ume. This definition has been chosen such that a density distribu- 
tion which is a function of the elliptical radius t 2 = R 2 +z 2 lq e \ 2 (e-g- 
a prolate or oblate ellipsoid of axial ratio q en ) gives <?*,3d = fci- For 
our best model we obtain an axial ratio q+^o = 0.847, which is 
rounder than the q of the total density distribution. From the ob- 
served isophotal two-dimensional axial ratio q-^^u = 0.77 (mea- 
sured by fitting ellipses to the surface brightness contours of the 
galaxy, Table [T} and from the best model inclination of 67° 8, one 
obtains a three-dimensional axial ratio of the stellar density of 
<7iso,3D = 0.72. This does not coincide with the value of q*,m cal- 
culated from the second moments of the luminosity density of the 
galaxy (equation [8}. The disagreement might indicate that the re- 
covered three-dimensional light distribution is not constant on el- 
lipsoidal surfaces or that there is a change in ellipticity as a function 

of radius. 

Following Cappella ri et al. I J2007h , we plot in Fig. 1 101 the ra- 
tio (v^/cr in the meridional plane between the mean rotation ve- 
locity around the z-axis and the local mean velocity dispersion 
<x 2 = (cr| + cr 2 + cr 2 )/3. This quantity provides an indication of 
the importance of the rotation with respect to the random motions 
for each position (R,z) in the galaxy. From the analysis of Fig. 1101 
SDSS J232 1-097 appears to be an overall slow rotator, although 
rotation seems to play a more significant role in the inner regions. 
Two small regions characterized by slight counter-rotation are lo- 
cated around the equatorial plane (z = 0). Due to the assumption 
of a two-integral DF, the cross-section of the velocity dispersion 
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Figure 10. Ratio (v v )/<r of the first and second velocity moments for the 
best reconstructed model of galaxy SDSS J2321— 097, plotted in the positive 
quadrant of the meridional plane. 



ellipsoids would be circular for every position on the meridional 
plane. 

Another helpful way to quantify the anisotropy of an axisym- 
metric gal axy is provided by the three glo bal anisotropy parameter s 
defined in Binn ev & Tremainel d 19871) and lCappellari et ai] d2007l) : 



p= 1 



y=l- 



where 
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2/J-y 



n H - = J pa 2 , d?x 



(9) 
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and cr k denotes the velocity dispersion along the direction k at any 
given location in the galaxy. For nearly spherical systems it can be 
convenient to consider the anisotropy parameter p r defined as 

Ik 



yS r = 1 



(13) 



in the spherical coordinates (r, 6, ip). Here n„ = (Tl m + ILJ)/2. 

For a two-integral DF, as mentioned above, a 2 R = cr 2 every- 
where, which implies YV RR = Tl, z . Therefore, the value of [} is al- 
ways zero, while y and 5 have opposite signs. Moreover, under this 
assumption it is easy to show that /?, = y/2. 

Calculating the non-zero anisotropy parameters for the best 
reconstructed model, we find y = -0.32, which indicates a slight 
tangential anisotropy, and S = 0.14. The latter value agrees well 
with the corresponding value determined directly from the ob- 
served data in Section [331 The anisotropy parameters found for 
SDSS J 2321-097 have v a lues w hich are consistent with the find- 
ings of ICappellari et a 1. (2007) for their most luminous galax- 
ies. It should be pointed out, however, that the comparison is not 
immediately straightforward, since their use of a three-integral 
Schwarzschild orbit-superposition method gives more flexibility to 
the models. 

We compared the radial profile of the total mass density of 
the galaxy (given by equation [3]by inserting the best-fitting model 
parameters) with the density of the stellar component alone, as ob- 
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Figure 11. Upper panel: mass enclosed within a radius r. The thick solid 
line shows the total mass of the galaxy as a function of r obtained from 
the best reconstructed model. The thick dashed line shows the same quan- 
tity for the luminous component obtained from the reconstructed DF and 
rescaled to the value which maximizes the contribution of luminous over 
dark matter ('maximum bulge'). As a comparison, the dash-dotted and dot- 
ted lines show the quantity M(r) for, respectively, a Jaffe and a Hernquist 
profile, normalized to the total luminous mass. Lower panel: logarithm of 
circularized mass density as a function of radius. The line styles have the 
same meaning as above. 



tained directly from the reconstructed DF. The axisymmetric den- 
sity distribution was circularized by averaging over spherical shells. 
The stellar density profile has been normalized by using as a con- 
straint the value M cff 2.0 x 10' 'Mq for the luminous mass in- 
side the effective radius, which is obtained from the observations 
when assuming an average M/L of 5.2 (M/L)q_ s . This 'maximum 
bulge' rescaling (analogous to the maximum disc approach for spi- 
ral galaxies) maximizes the contribution of the luminous compo- 
nent with respect to the total mass. The density and the enclosed 
mass as a function of radius are plotted in Fig.[TT] together wi th the 
same functions for the spherical Jaffe and Hernquist profiles ( Jaffe 
ll983l : lHernquistl [ 1990) normalized to the total mass 2M e g. While 
very simple, these models appear to provide a reasonably good ap- 
proximation to the light distribution. From the diagrams, one can 
note that the total mass density distribution follows the light in the 



inner regions up to a distance of about 5 kpc, where the contribu- 
tion of the dark matter component starts to become significant: at 
5 kpc the non- visible mass represents about 15 per cent of the total 
mass, rising to ~ 30 per cent at 10 kpc (the distance for which the 
enclosed stellar mass approximately equals M c ff, i.e. the distance 
corresponding to the unprojected effective radius) and ~ 40 per 
cent at 15 kpc. 

The spatial coverage of the data might appear quite limited 
when it comes to the study of the total density profile at radii of the 
order of 10 kpc, since the Einstein radius i?Einst - 2.6 kpc, the ef- 
fective radius R c h is of the order of 8 kpc (which however becomes 
~ 10 kpc when unprojected) and the integral-field kinematics does 
not extend much farther than 3 kpc. However, one should note that 
a fair amount of information comes also from those distant regions 
of the galaxy that are seen in projection along the line of sight. In 
order to verify this, we consider the intersection between a sphere 
of radius r, centred on the galaxy, and a cylinder oriented along the 
line of sight which has a radius equal to, respectively, the Einstein 
radius and the effective radius. In the first case, it is found that for 
r = i?Einst the mass enclosed within this volume corresponds to ap- 
proximately 75 per cent of M Einst (the total mass within the Einstein 
radius). In other words, one-fourth of the contribution to the grav- 
itational lensing comes from matter which is located farther than 
2.6 kpc but falls, in projection, within the Einstein radius. In the 
second case, we obtain that for r = R e g the enclosed light is about 
77 per cent of L e fj, that is, again almost one-fourth of the luminosity 
enclosed within the cylinder of radius R cSf comes from regions at a 
radial distance r > R c ff from the centre of the galaxy. Any changes 
in the density distribution at radial distances larger than i?Einst or R c h 
which still influence the data model inside those projected radii will 
have an effect which is reflected statistically in the evidence (or in 
the likelihood), allowing to discriminate between different models. 
Clearly, the effect becomes progressively weaker with increasing 
radial distances. 

We also explored a constant-M/L model, by considering a 
double power-law density distribution which approximates the re- 
constructed luminous density profile of Fig.QT| this density profile 
is nearly isothermal in the inner regions but the slope becomes sig- 
nificantly steeper (y' out ~ 3.3) outwards. The M/L is 5.9 (M/L)q b . 
We find that the constant-M/L model, while able to reproduce the 
data, fits the dynamics somewhat worse: the x 2 P er pixel for the 
velocity dispersion is ~ 2.2, higher than in the case of the single 
power-law model (x 2 per pixel ~ 1.4). For the lensing and the sur- 
face brightness distribution, instead, the values of x 1 P er pixel of 
the two models are comparable (with the double power law being 
just a few per cent higher). In conclusion, under the considered as- 
sumptions, the nearly isothermal single power-law model is slightly 
preferred by the data over a constant-M/L model with no dark mat- 
ter. It should be noted, however, that the difference between the 
constant-M/L model and the power-law model (for which mass 
does not need to follow light) is relatively small. It is possible that 
much or all of this discrepancy could disappear if one allowed for 
more sophisticated three-integral dynamical models. 

4.4 Uncertainties 

The model uncertainties, that is, the scatter in the recovered non- 
linear parameters i, a , y' and q, were determined by considering 
128 random realizations of the datasets and rerunning the caul- 
dron algorithm for each of them. Table [3] summarizes the results 
of the statistical analysis and shows the 68 and 95 per cent con- 
fidence intervals for the four non-linear parameters. Fig.l 121 shows 



Two-dimensional kinematics ofSLACS lenses - 1. 



13 



Table 3. Summary of the model uncertainties, determined by considering 128 random realizations of the best model data sets. The 
reconstructed parameters obtained for the best model Albcst are a l so reported for comparison. 
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Figure 12. Graphical visualization of the lower triangle of the (symmetric) correlation matrix for the parameters recovered from the non-linear reconstruction 
of 128 random realizations of the best model data sets. The panels on the main diagonal present the distribution of the four non-linear parameters i, ag, y' and 
q over the realizations. In the remaining panels the non-linear parameters are plotted two by two against each other; for each panel, the corresponding value 
Pij of the correlation matrix is also indicated. The dotted vertical lines indicate the 68 per cent confidence interval. The red stars indicate the locations of the 
parameters obtained from the best model reconstruction Albest (cf. Table l4.2V 
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a graphical visualization of the correlation matrix, displaying the 
parameters plotted against each other, while the histograms on the 
main diagonal present the distribution of the four parameters over 
the realizations. These distributions, which provide the error bars 
for the best-fitting model parameters, can immediately be seen to 
be generally non-Gaussian. 

As highlighted in BK07, there appears to be a partial degener- 
acy between the inclination i and the axial ratio q, which is however 
limited only within a certain interval in the parameters. The lens 
strength a is quite tightly constrained, and the logarithmic slope 
7' spans over an interval of values all very close to the isothermal 
7' = 2 case. 



5 BREAKING DEGENERACIES: OBLATENESS AND 
INCLINATION 

When combining stellar kinematic and gravitational lensing data 
of an early-type galaxy, even under the simplifying assump- 
tion of spherical symmetry, it has been shown that the de- 



weighted stellar velocity dispersion integrated over the meridional 
plane of a galaxy with inclination i = is given by 



generacies between mass, orbital ani sotropy and total densit 



slope can effectively be broken (e.g . Treu & Koopmans 2002 
Koopmans et al.l2003l:lKoopmans & Treul2003l : lTreu & Koopmans 



20041 : iKoopmans et al J 1 20061) . More recently, the theoretical ba- 



sis of this methodology was made self-consistent and extended 
to include flattened axisymmetric mass distributions and the mod- 
elling of the stellar two-integral phase-space DF f(E, L-) (BK07). 
It was shown that under the assumptions of axisymmetry and two- 
integral DFs not only the degeneracy between mass, mass slope 
and anisotropy can be broken, but also that between the flattening 
(i.e. oblateness) of the mass distribution and its inclination with 
respect to an observer. The latter degeneracy poses a restriction 
on most current dynamical modelling effo rts, which often have 
to assume some (rang e of) inclination (e.g. lEmsellem et alj[2007l : 
ICappellari et alj2007h . 

By combining the information from lensing and dynam- 
ics we have found a well-defined mass slope (consistent with 
Koopmans et al. 2006), inclination and oblateness for the early- 
type lens galaxy SDSS J2321-097 (see e.g. Fig. I12t. In this sec- 
tion, we show why the degeneracy between oblateness and incli- 
nation can, in principle, be broken. As in all of this paper, we as- 
sume axisymmetry and restrict the phase-space density distribution 
to be a function of the two classical integrals of motion, E and 
L z (these assumptions hold well in the system considered in this 
paper). In this case, it can be shown from symmetry arguments 
(Bin nev & Tremaineiri 987) that at any point in the galaxy the two- 
dimensional cut through the stellar velocity ellipsoid in the merid- 
ional plane (R, z) is round (i.e. cr R = cr-). For an observer located in 
the meridional plane, the line-of-sight component of the stellar ve- 
locity dispersion from any given point on the same plane will there- 
fore appear independent of inclination. The luminosity-weighted 
integral of the stellar velocity dispersion over the entire meridional 
plane is then also independent of inclination. Because an observer 
is always situated in the meridional plane spanned by himself and 
the minor axis of an axisymmetric galaxy, we state that: 

The surface brightness weighted integral over the line-of-sight stellar ve- 
locity dispersion along the minor axis of an axisymmetric galaxy with a DF 
that depends only on energy and angular momentum, f(E, L z ), is indepen- 
dent of inclination. This also holds in the case of streaming motions which 
are perpendicular to the minor axis. 

This can be formally proved as follows. The luminosity- 
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(14) 



Under a rotation, denoted by the orthogonal rotational matrix R(i) 
with detR(/) = detR~'(i) = 1, the coordinate system can be trans- 
formed as x' = K(i)x with x = (R,z) and x' = (R',z')- The above 
equation can then, after coordinate transformation, be written as 



« P xo 



SZo rLP*( R '> z '*> detR-'(0dfl' dz' 



(15) 



Because rotation does not alter the line-of-sight velocity dispersion 
of a given point in the meridional plane if the potential is axisym- 
metric and the DF is only a function of E and L-, it is easy to see 
that crf 0S (R', f) = a^ os (R,z) for any inclination i. Also the scalar p* 
is invariant under rotation, hence p i ,(R',z') = p*(R,z)- Given the 
fact that detR~'(i) = 1, it immediately follows that 
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proving the above statement. Hence, (0"jL)(() effectively becomes a 
function only of the density profile, mass and flattening of the mass 
distribution. Since the former two are well constrained by gravita- 
tional lensing alone and even better in combination with the stel- 
lar kinematics, (cr^ p )(() reduces further to a function of mostly the 
oblateness: for a given mass and mass profile, a more oblate galaxy 
will have a larger value of (cr^ p >(i). Because g*,3D and inclination 
are restricted by the observed brightness distribution of the galaxy, 
and the oblateness by the value of (cr^ p ), one can solve for the in- 
clination and oblateness simultaneously. 

From the IFU data the value of (o"L,) can be inferred (because 
of the limited field of view, we cannot integrate over the entire mi- 
nor axis and (cr^ p ) might therefore still be a weak function of i) and 
in combination with the observed flattening of the galaxy bright- 
ness distribution, the inclination can be inferred. It is clear that this 
can only be done when the stellar velocity dispersion along the en- 
tire minor axis is known and that the inclination cannot be inferred 
from a total luminosity-weighted stellar velocity dispersion. 

We again emphasize here that the above result only holds in 
axisymmetric two-integral situations. To what extent it remains 
valid in cases where a third integral of motion is allowed (or ax- 
isymmetry is broken), remains a subject of further study. 



6 DISCUSSION AND CONCLUSIONS 

In this paper we have presented the first results from an integral- 
field spectroscopic survey of early-type lens galaxies from SLACS. 
The combination of integral-field spectroscopy from VIMOS/IFU 
mounted on the VLT with high-resolution imaging from HST/ ACS 
has enabled us to conduct the first in-depth study of the struc- 
ture of a luminous elliptical galaxy beyond the local Universe, 
SDSSJ2321-097 at z = 0.0819. We have applied a new analy- 
sis method that combines the kinematic and lensing information in 
a fully self-consistent way and have shown how this combination 
breaks some of the degeneracies that limit the separate application 
of these two methods. 

The galaxy that we have studied here turns out to be a 
fairly ordinary elliptical with properties similar to those of lo- 
cal galaxies of comparable luminosity, such as t hose studied by 
the SAURON collaboration (Ems ellem et alj2007l : ICappellari et al.l 
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2007b. SDSSJ2321-0 97 is a slow rotator in the classification of 
Emsell em et al, I d2007l) with an angular momentum parameter of 
/{r = 0.075. The velocity dispersion map is flat to the limit where 
we were able to measur e the kinematic parameters reliably. Using 
the updated estimator of lBinnevl d2005h for the ratio between sys- 
tematic and random velocities, v/cr v , that fully exploits the infor- 
mation in integral-field spectroscopic data, we have shown that the 
ellipticity of the stellar distribution in SDSS J232 1-097 is due to 
anisotropy of the velocity distribution rather than rotation, again in 
line w ith local galaxies of comparable luminosity ( ICappellari et al.1 
l2007h . 

We have modelled the galaxy by making use of the cauldron 
algorithm which self-consistently combines gravitational lensing 
and stellar dynamics under the assumption of axisymmetry and 
a two-integral DF. We adopted for the system the total gravita- 
tional potential generated by an axisymmetric power-law mass den- 
sity distribution of logarithmic slope y' and axial ratio q. The 
best-fitting model given the data is obtained in the framework of 
Bayesian statistics by maximizing the evidence merit function. The 
results for the best-fitting model are summarized as follows. 

(i) The logarithmic slope of the total density is y' = 2.06^"^ 
(the error is given within the 68 per cent confidence interval), which 
is very close to (and consistent with) an isothermal density distri- 
bution. 

(ii) The axial ratio of the total density distribution is q = 
0.74+jj jjj. Since in our approach mass is not required to follow light, 
this q does not have to coincide with the (average) axial ratio of the 
luminous distribution, which is calculated from the reconstructed 
DF, giving 5*30 = 0.85 for the best model. 

(iii) The inclination angle of the galaxy is i = 67°8+,g. The small 
error bar shows that inclination can be well determined in combi- 
nation with lensing data. 

(iv) The 'maximum bulge' approach, that is, the rescaling of 
the circularized stellar density profile which maximizes the con- 
tribution of the luminous component to the total density profile, 
prescribes a stellar mass M cff ^ 2.0 x 10" Mq inside the effec- 
tive radius, which corresponds to an average M/L of 5.2 (M/L)q s . 
The total mass enclosed in the same region is approximately 
2.9 x 10' 1 Mq : the non- visible matter, therefore, accounts for about 
30 per cent of the total mass within that three-dimensional radius. 

(v) The local (fJ v )/<r ratio on the meridional plane confirms that 
SDSS J232 1 -097 as a whole is a slow rotator, with the random mo- 
tions becoming less predominant compared to rotation only in the 
very central regions. The best model yields a global anisotropy pa- 
rameter S = 0.14, fully consistent with the value obtained directly 
from the data, showing that the galaxy is close to an isotropic rota- 
tor. The other global anisotropy parameters have values p = [as 
a consequence of having assumed a two-integral DF f(E, L-)] and 
y = -0.32, indicating a mild tangential anisotropy. 

These results a re in good agreement with the analysis of 
SDSSJ2321-097 bv lKoopmans et alj j2006l) , which we here ex- 
tend in a more rigorous way. In particular, the results confirm the es- 
sentially isothermal profile of the mass density distribution, which 
appears to be a defining characteristic of early-type galaxies. 

The best-fitting model is consistent with a dark matter frac- 
tion of 30 per cent within lOkpc (approximately corresponding to 
the un protected effective radius), similar to what ICappellari et al.1 
(2006) determine for the SAURON sample of early-type galaxies 
by making use of three-integral Schwarzschild dynamical models 
under the caveat that light traces mass. However, it seems plausi- 
ble that a constant-M/L model could still reproduce the observed 



kinematics of SDSS J232 1-097 if one allowed for a more flexible 
three-integral dynamical model rather than the two-integral model 
considered in this work. The relative difficulty in unambiguously 
discriminating between the constant-M/L model and the power- 
law model is also a consequence of the fact that for the specific 
lens system studied here, the mass within the Einstein radius is to 
a large extent dominated by the stellar mass, so that the difference 
between the two models does not show up dramatically in the lens 
model. The combined analysis of more distant objects, however, 
is expected to provide more unequivocal results in this respect. In 
general, for galaxies at higher redshift the ratio between the Ein- 
stein radius and the effe ctive radius becomes l arger (see e.g. the 
SLACS galaxy sample in lKoopmans et al]|20 06) and any deviation 
from a model for which mass follows light would become signif- 
icantly more prominent, since at least the surface brightness slope 
would be much steeper than what is allowed by the lensing data. 

The dynamical structure of SDSS J232 1-097 (i.e. anisotropy 
an d (t> M )/<T map) is also in good agreement with what is found 
by ICappellari etai] (2007) for the most massive ellipticals of the 
SAURON sample. 

The analysis has shown that the combination of gravitational 
lensing and stellar dynamics is a powerful method which allows 
the dissection in three dimensions of an elliptical galaxy (assumed 
to be well described as a two-integral axisymmetric dynamical 
system), breaking to a significant extent the classical degenera- 
cies between inclination and flattening as well as between mass 
and anisotropy. The way the degeneracy between inclination and 
oblateness is overcome can be understood within a simplified phys- 
ical picture: q and ( are coupled in the projected potential which en- 
ters in the description of gravitational lensing; when it comes to the 
dynamics, however, the surface brightness weighted integral along 
the minor axis over the line-of-sight stellar velocity dispersion is 
not a function of the inclination, since, due to the properties of the 
two-integral DF, the intersection of the velocity dispersion tensor 
with the (vg, v : ) plane is always a circle, that is, <x| = a 1 , for each 
position in the galaxy (see Section|5]l. 

We have shown that the method, in its implementation as the 
cauldron algorithm, is robust enough to make use of observational 
data in order to recover the non-linear parameters which character- 
ize the total gravitational potential and the geometry of the system 
(i.e. inclination, positional angle and lens centre) with relatively 
tight error bars (the confidence intervals shown in Table [3). This 
first application therefore shows promise for the future study of the 
other SLACS systems at higher redshift. 

In forthcoming papers in this series we will extend this work 
to the entire sample of 17 SLACS lenses with VLT VIMOS IFS, 
covering a range of lens galaxy morphology, mass and redshift (z = 
0.08 - 0.35). The VLT sample will be complemented by a further 
13 lenses for which we have obtained long-slit sp ectroscopy at the 
Low Resolution Imager and Spectrograph (LRIS. Ioke et al.iri995h 
on the Keck-I telescope. Several slit positions - aligned with the 
major axis and offset along the minor axis - have been obtained for 
each system in the Keck sample, thus effectively producing two- 
dimensional kinematic information across most of the lens galaxy. 
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